Consideration of Site-wise Confidence in Fuzzy Co-clustering of Vertically Distributed Cooccurrence Data

نویسندگان

  • Toshiya Oda
  • Katsuhiro Honda
  • Seiki Ubukata
  • Akira Notsu
چکیده

In advanced information and telecommunications network society, it is expected to utilize big data distributed among various organizations, such as cooperation groups, state organs and allied countries, with the goal of revealing intrinsic knowledge. In such collaborative data mining, however, personal privacy must be strictly preserved. This paper deals with a possible approach for utilizing distributed cooccurrence information in fuzzy coclustering context under privacy consideration. Fuzzy Clustering for Categorical Multivariate data (FCCM) is a basic fuzzy co-clustering model and have been extended so as to perform privacy preserving data analysis. The secure model is further improved in this paper so that we can find robust knowledge, which is free from the influences of unreliable site, considering site-wise confidences. The applicability of the proposed model is demonstrated in several numerical experiments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Entropy-based Consensus for Distributed Data Clustering

The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...

متن کامل

DISTRIBUTED AND COLLABORATIVE FUZZY MODELING

In this study, we introduce and study a concept of distributed fuzzymodeling. Fuzzy modeling encountered so far is predominantly of a centralizednature by being focused on the use of a single data set. In contrast to this style ofmodeling, the proposed paradigm of distributed and collaborative modeling isconcerned with distributed models which are constructed in a highly collaborativefashion. I...

متن کامل

Improving Imbalanced data classification accuracy by using Fuzzy Similarity Measure and subtractive clustering

 Classification is an one of the important parts of data mining and knowledge discovery. In most cases, the data that is utilized to used to training the clusters is not well distributed. This inappropriate distribution occurs when one class has a large number of samples but while the number of other class samples is naturally inherently low. In general, the methods of solving this kind of prob...

متن کامل

Higher-order Co-occurrence Features based on Discriminative Co-clusters for Image Classification

Co-occurrence based image features have attracted keen attentions due to the promising performances for image classification tasks [1, 2, 3, 6, 7]. For extracting the co-occurrences, it is common to transform the quantitative data into qualitative data (symbols) by means of quantization (clustering) at first; e.g., continuous gradient orientation is coded into orientation bins [3], RGB colors a...

متن کامل

Fuzzy Co-Clustering and Application to Collaborative Filtering

Cooccurrence information analysis became more popular in many web-based system analyses such as document analysis or purchase history analysis. Rather than the conventional multivariate observations, each object is characterized by its cooccurrence degrees with various items, and the goal is often to extract co-cluster structures among objects and items, such that mutually familiar object-item ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016